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Human coronavirus NL63 (NL63), a member of the group I coronaviruses, may cause acute 
respiratory diseases in young children and immunocompromised adults. Like severe acute 
respiratory syndrome coronavirus (SARS-CoV), NL63 also employs the human angiotensin¬ 
converting enzyme 2 (hACE2) receptor for cellular entry. To identify residues in the spike protein 
of NL63 that are important for hACE2 binding, this study first generated a series of SI-truncated 
variants, examined their associations with the hACE2 receptor and subsequently mapped a 
minimal receptor-binding domain (RBD) that consisted of 141 residues (aa 476-616) towards 
the C terminus of the SI domain. The data also demonstrated that the NL63 RBD bound to 
hACE2 more efficiently than its full-length counterpart and had a binding efficiency comparable to 
the SI or RBD of SARS-CoV. A further series of RBD variants was generated using site-directed 
mutagenesis and random mutant library screening assays, and identified 15 residues (C497, 
Y498, V499, C500, K501, R518, R530, V531, G534, G537, D538, S540, E582, W585 and 
T591) that appeared to be critical for the RBD-hACE2 association. These critical residues 
clustered in three separate regions (designated Rl, Rll and Rill) inside the RBD, which may 
represent three receptor-binding sites. These results may help to delineate the molecular 
Received 22 July 2007 interactions between the S protein of NL63 and the hACE2 receptor, and may also enhance our 

Accepted 17 December 2007 understanding of the pathogenesis of NL63 and SARS-CoV. 


INTRODUCTION 

Coronaviruses consist of a large and diverse family of 
enveloped, positive-sense, single-stranded RNA viruses. 
They can infect a broad range of mammalian and avian 
species, causing a variety of diseases in the respiratory, 
gastrointestinal, hepatic and central nervous systems 
(Holmes & Lai, 1996). The outbreak of severe acute 
respiratory syndrome (SARS) in 2002-2003, which was 
caused by a highly pathogenic virus named SARS 
coronavirus (SARS-CoV), posed significant challenges to 
medical communities worldwide (Ksiazek et al, 2003; 
Peiris et al, 2003). Since then, two additional human 
coronaviruses, HCoV-NL63 (NL63) and HCoV-HKUl, 
have been identified (van der Hoek et al, 2004; Woo et al, 
2005). NL63 has been found to be associated with diseases 
in the upper and lower respiratory tracts, such as 


bronchiolitis, conjunctivitis, croup and pneumonia in 
young children and immunocompromised adults (Arden 
et al, 2005; Bastien et al, 2005; Ebihara et al, 2005; Gerna 
et al, 2006; Kaiser et al, 2005; van der Hoek et al, 2005, 
2006). So far, NL63 infections have been reported in 12 
countries across Europe, Asia and North America, 
indicating that it is circulating among the human 
population worldwide (Arden et al, 2005; Bastien et al, 
2005; Han et al, 2007; Koetz et al, 2006; Lau et al, 2006; 
Suzuki et al, 2005; Vabret et al, 2005). 

NL63 is a member of the group I coronaviruses, which also 
includes HCoV-229E (229E), feline infectious peritonitis 
virus 79-1146, feline enteric coronavirus 79-1683, canine 
coronavirus and porcine transmissible gastroenteritis virus 
(TGEV) (van der Hoek et al, 2004, 2006). The genome of 
NL63 contains 27553 nt and encodes the polyproteins la 
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and lb, spike (S), open reading frame 3 (ORF3), envelope 
(E), membrane (M) and nucleocapsid (N) proteins (van 
der Hoek et al, 2004) (Fig. 1). The S protein of NL63 is 
a 1356 aa, type I membrane glycoprotein, which contains a 
signal peptide (1-20 aa), an ectodomain (1-1302 aa), a 
transmembrane domain (1303-1325 aa) and a cytoplasmic 
tail (1326-1356 aa). The S protein can also be divided into 
SI (21-717 aa) and S2 (718-1356 aa) domains based on its 
similarity to S proteins of other coronaviruses (Jackwood 
et al, 2001; Sturman & Holmes, 1984). The SI domain of 
coronaviruses is responsible for binding to the cellular 
receptor, whilst the S2 domain is mainly involved in 
membrane fusion during viral entry into the host cell 
(Bonavia et al, 2003; Gallagher & Buchmeier, 2001). 
Human aminopeptidase N (hAPN or CD13) and human 
angiotensin-converting enzyme 2 (hACE2) have been 
identified as the functional receptors for 229E and SARS- 
CoV, respectively (Li et al., 2003; Yeager et al, 1992). As all 
previously known group I coronaviruses use the APN of 
their own host species as functional receptors (Tresnan & 
Holmes, 1998; Tresnan et al, 1996), and viral sequence 
analysis indicates that NL63 is most closely related to 229E 
(van der Hoek et al, 2004), it has been proposed that NL63 
may also employ hAPN as its receptor. However, hACE2 is 
a functional receptor for NL63 (Hofmann et al, 2005). 
This finding was quite surprising given that the S protein of 
NL63 shows only limited sequence similarities to that of 
SARS-CoV. 

Previous studies have demonstrated that the SI domains of 
coronaviruses may contain so-called receptor-binding 
domains (RBDs) that could directly bind to the cellular 
receptors. The RBD of the S protein of SARS-CoV has been 
mapped to a fragment consisting of aa 318-510 in the SI 
domain (Babcock et al, 2004; Wong et al, 2004). Whilst 
the first 330 residues at the N terminus of the mouse 
hepatitis virus (MHV) SI subunit are able to bind 
efficiently to carcinoembryonic antigen-related cell adhe¬ 
sion molecule 1 (CEACAM1), the RBDs of 229E and TGEV 
are all located at the C terminus of their SI domains 
(Ballesteros et al, 1997; Bonavia et al, 2003; Godet et al, 
1994; Laude et al, 1995). A recent study suggested that the 
central and C-terminal domains of the NL63 S protein are 
required for hACE2 association (Hofmann et al, 2006). 
However, the location of the NL63 RBD remains to be 
defined and the critical residues for receptor binding have 
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Fig. i. Schematic diagram of the genome organization of NL63 
and the deduced domains of its S protein. 


not been characterized. In this study, we generated a series 
of SI-truncated variants, tested their association with the 
hACE2 receptor and subsequently mapped an RBD that 
consists of 141 aa (aa 476-616) towards the C terminus of 
the NL63 SI domain. In addition, we identified 15 residues 
in the RBD that appear to be critical for receptor binding. 
Interestingly, these critical residues seemed to cluster into 
three separate regions (designated RI, RII and RIII) inside 
the RBD, suggesting that these sites might be associated 
with receptor binding. 

METHODS _ 

Construction of expression plasmids. A plasmid encoding hACE2 
with a c-myc tag at the N terminus and a C9 tag at the C terminus of 
hACE2 has been described previously (Li et al., 2005b). In this study, 
the domain structures of the NL63 S protein were defined using 
computer programs from the websites of SOSUI (http://bp.nuap. 
nagoya-u.ac.jp/sosui/sosui_submit.html) and Pfani (http://www. 
sanger.ac.uk/Software/Pfam/). To construct the plasmids expressing 
SI and its variants, we first synthesized a full-length, codon-optimized 
NL63 SI gene (encoding aa 21-717) according to the method 
described by Babcock et al. (2004). All of the NL63 Sl-truncated 
variants used in this study were generated based on this template. The 
RBD variants were generated by site-directed mutagenesis and/or 
random mutant library screening assays using a GeneMorph Random 
Mutagenesis kit (Stratagene) and overlapping PCR strategy. The 
cDNA inserts encoding the SI, RBD or a null (aa 12-327) fragment of 
SARS-CoV were amplified based on a codon-optimized S protein 
sequence of SARS-CoV described previously (Li et al., 2003). For 
expression and production of soluble SI or RBD variants, the cDNA 
inserts were cloned into a mammalian soluble protein expression 
vector, pSecTag/Hygro (Invitrogen), respectively, thereby producing 
c-myc- and His-tagged fusion proteins. In addition, some constructs 
were cloned into a modified pSecTag/Hygro-Ig vector that contains 
the mouse Igic-chain leader sequence and human IgG Fc fragment, 
producing hACE2-Ig, Sl-Ig, RBD-Ig or other Ig fusion proteins. All 
expression plasmids used in this study were confirmed by DNA 
sequencing. 

Expression and purification of proteins. To make soluble hACE2, SI 
or RBD proteins, 293T cells were transfected with the expression 
plasmids described above using a standard calcium phosphate 
transfection method. At 16-20 h after transfection, the cells were 
washed with PBS and cultured in serum-free medium (293 SFM II 
medium; Invitrogen). The protein-containing supernatants were 
collected on days 2, 3 and 4 post-transfection and pooled. An aliquot 
of the pooled protein supernatants was analysed by Western blotting 
(WB) to confirm the target protein expression. The supernatants were 
then passed through a 0.45 pm filter (Nalgene) and dialysed with 
binding buffer prior to affinity purification. The affinity column packed 
with charged nickel beads (Qiagen) was used to purify the 6 x His- 
tagged proteins. To purify the Ig fusion proteins (hACE2-Ig, S1—Ig or 
RBD-Ig), the supernatants were incubated with protein A/G-Sepharose 
beads at 4 °C for 16 h and then washed with 0.5 M NaCl in PBS. The 
samples were subsequently eluted with 50 mM sodium citrate/50 mM 
glycine (pH 2.0), neutralized with NaOH and dialysed against PBS as 
described previously (Sui et al, 2004). The purified proteins were 
concentrated with Centricon filters (Amicon) and dialysed against PBS. 
Protein concentration was measured with a protein assay kit (Bio-Rad) 
and aliquots of the proteins were stored at -80 °C. 

WB. To detect proteins by WB, purified proteins, cell culture 
supernatants or cell lysates were resolved by 10% SDS-PAGE and 
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transferred to a nitrocellulose membrane (Amersham Biosciences). 
The membrane was blocked for 1 h at room temperature with 
blocking buffer (5% skimmed milk, 0.05% Tween 20 in PBS) and 
incubated with primary antibody (anti-c-myc antibody at a 1:2000 
dilution or NL63-positive human serum at a 1:500 dilution) 
overnight at 4 °C, followed by three washes with washing buffer 
(0.05% Tween 20 in PBS). The blot was then incubated with 
secondary antibody (HRP-conjugated anti-mouse IgG or HRP- 
conjugated anti-human IgG at a 1:4000 dilution; Amersham 
Bioscience) for 1 h at room temperature and washed three times 
with washing buffer and once with PBS. The protein bands were 
detected using ECL reagents (Amersham Biosciences). The density of 
the protein band was quantified using Scion Image (National 
Institutes of Health, Bethesda, MD, USA). 

Immunoprecipitation (IP) assay. Cell lysate was prepared from 
hACE2-transfected 293T cells with lysis buffer consisting of 100 mM 
(NH 4 ) 2 S0 4 , 20 mM Tris/HCl (pH 7.5), 10% glycerol, 1% Cymal 5 
and protease inhibitor cocktail. To detect an S-ACE2 association by 
IP, purified SI or RBD proteins (1 pM) were mixed with 200 pg 
hACE2-C9 cell lysate and 2.5 pi anti-His antibody (Amersham 
Biosciences). The mixture was incubated at 4 °C overnight, followed 
by the addition of 30 pi protein A/G beads, and incubation was 
continued at 4 °C for 2 h. The precipitate was washed three times 
with Triton X-100 in PBS and resuspended in 40 pi protein loading 
buffer for detection of proteins by WB. Additionally, we performed IP 
using soluble hACE2-Ig proteins that were incubated with supernat¬ 
ant containing Sl-c-myc-His or RBD-c-myc-His protein and 
protein A/G beads, followed by the steps described above. 

Flow cytometry. To detect receptor expression and/or the binding 
interaction between SI (or RBD) and the receptors on the cell surface, 
flow cytometry (or FACS) was performed as described previously 
(Zhang et al, 1996). In brief, the target cells were detached with 
5 mM EDTA in PBS and washed twice with immunofluorescence (IF) 
buffer (1 % fetal bovine serum and 0.02% NaN 3 in PBS). The cells 
were incubated with anti-c-myc antibody (for the detection of 
receptor expression) or with Sl-Ig or RBD-Ig protein (for the 
detection of binding between SI or RBD and the receptor) on ice for 
1 h, followed by three washes with IF buffer. They were further 
incubated with fluorescein isothiocyanate (FITC)-conjugated anti¬ 
mouse IgG or FITC-conjugated anti-human IgG on ice for 30 min. 
The cells were washed three times with IF buffer, fixed with 1 % 
formaldehyde and analysed using a Beckman Coulter FC5000 
cytofluorometer. 


RESULTS 


The receptor-binding site is located in the 
C-terminal region of the SI domain 

To localize the region in the SI domain of NL63 that is 
responsible for hACE2 binding, we first constructed a panel 
of expression plasmids that encoded full-length SI ( 821 - 717 ) 
and six C-terminally truncated SI variants: S 2 i_ 6 i 6 > S 2 i_si 6 , 
S 2 i- 460 > S 2 i_ 4 io> S 2 i_ 3 6 o and S 2 i_ 260 (Fig. 2a). We then 
expressed and purified these SI variants from human 293T 
cells. We demonstrated by WB that both anti-c-myc 
antibody (Fig. 2b, top panel) and NL63-positive human 
serum (Fig. 2b, middle panel) recognized these proteins, 
suggesting that they folded properly and retained their 
antigenicity. We also expressed and purified three SI 
variants from SARS-CoV: full-length SI (S 12 _ 670 ), RBD 
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Fig. 2. Localization of the receptor-binding region of NL63 SI. (a) 
Diagram of the codon-optimized full-length SI and six C-terminally 
truncated SI variants constructed for this study, (b) Detection of 
purified SI variant proteins by WB and S1-hACE2 associations 
by IP. All of the purified c-myc- and His-tagged SI variants 
(500 ng each, from both SARS-CoV and NL63) were detected by 
WB using anti-c-myc antibody (top panel), whereas only the SI 
variants of NL63 were recognized by NL63-positive human serum 
(bottom panel). For the IP assay (middle panel), SI proteins 
(1000 nM each) were incubated with 100 pg C9-tagged hACE2 
protein lysate, anti-His antibody and protein A/G beads, followed 
by WB detection of the precipitated hACE2 receptor using anti- 
09 antibody. The data showed that hACE2 could bind to full- 
length SI and to S 2 i-6i6, but not to other C-terminally truncated 
SI variants of NL63. 


(S 318 — 510 ) an d a null fragment (Si 2 _ 327 ), which were 
detected by anti-c-myc antibody but not the NF63-positive 
human serum (Fig. 2b, top and bottom panels). To 
examine Sl-hACE2 associations, we performed an IP assay 
using anti-FIis antibody, followed by WB using anti-C9 
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antibody to detect precipitated ACE2 receptors. Fig. 2b 
(middle panel) shows that SI and S 21 _ 616 of NL63 were able 
to bind to hACE2, although their binding efficiencies 
appeared to be much lower than the SI and RBD of SARS- 
CoV. In contrast, we did not detect an association between 
hACE2 and the other SI variants (S 21 _ 516 , S 2 i_ 4 60 , S 2 i_ 4 10 , 
S 2 i- 36 o and S 2 i_ 260 ), suggesting that the region spanning aa 
516-616 is required for receptor binding, whereas the 
region at the very end of the C terminus (aa 617-717) is 
dispensable for receptor binding. These results are 
consistent with previous findings reported by others 
(Hofmann et al, 2006). 

Mapping a minimal RBD 

To map further the RBD, we constructed another panel of 
SI variants with deletions in the N- and/or C-terminal 
regions of the SI domain (Fig. 3a). Protein expression of 
three SI variants (S 311 _ 616> S 361 _ 6 i 6 and S 411 _ 616 ) was not 
detected in the supernatants of transfected 293T cells (data 
not shown), suggesting that the deleted regions could be 
important for stability, folding and/or secretion of the 
proteins. In contrast, the other five SI variants were 
expressed at high levels in the supernatants of transfected 
293T cells. We then purified these SI variants that were 
detected by WB using anti-c-myc antibody (Fig. 3b, top 
panel) and performed IP to detect an Sl-hACE2 
association. As expected, SI and RBD of SARS-CoV bound 
efficiently to hACE2, whilst the null fragment (S i2 _ 327 ) did 
not (Fig. 3b, bottom panel). As for the NL63 SI variants, 
we could detect an association between hACE2 and full- 
length SI (S 21 _ 7 i 7 ), S 2 io_ 7 i 7 , S 3 21 — 7 1 7 and S 461 _ 6 i 6 , respect¬ 
ively, but not between hACE2 and S 31 i _ 566 or S 517 _ 7 i 7 . In 
particular, we demonstrated that the S 46 i_ 6 i 6 variant was 
able to bind to hACE2 much more efficiently than full- 
length SI and had a binding efficiency comparable to that 
of SI or RBD of SARS-CoV (Fig. 3b, bottom panel). 

To determine whether S 46 i_ 6 16 is the minimal fragment that 
can bind to hACE2, we generated a further four truncated 
variants, S 476 _ 616 , S 491 _ 616 , S 461 _ 601 and S 461 _ 586 . Whilst the 
expression levels of S 476 _ 616 and S 461 _ 60 i were comparable to 
that of S 46 i_ 616 , the other two variants (S 49 i _ 616 and S 461 _ 586 ) 
did not express detectable proteins in the supernatants of 
transfected 293T cells (data not shown). We then purified 
S 47 6 - 6 i 6 an d S 46 i_ 60 i and used equal amounts of the proteins 
(Fig. 3c, top panel) for reciprocal IP assays in which either the 
S proteins (Fig. 3c, middle panel) or the hACE2 receptors 
(Fig. 3c, bottom panel) were precipitated and detected. In 
both IP assays, S 476 _ 616 bound to hACE2 as efficiently as 
S 4 6 i_ 6 i 6 > whilst S 461 _ 60 i did not bind. We also measured the 
Sl-hACE2 association by flow cytometry using similar 
amounts of Sl-Ig proteins of three NL63 S variants (SI, the 
minimal RBD S 476 _ 616 and the null fragment S 21 _ 460 ) and 
three SARS-CoV S constructs (SI, RBD and null) (Fig. 3d 
and e). The quantitative data presented in Fig. 3(e) showed 
dramatic differences in binding efficiency between NL63 
RBD-hACE2 and Sl-hACE2. However, we did not observe 


such differences among NL63 RBD-hACE2, SARS-CoV 
RBD-hACE2 and SARS-CoV Sl-hACE2. Therefore, the 
flow cytometry data were consistent with the IP data. 
Collectively, our study defined the fragment S 476 _ 616 as the 
minimal RBD of the S protein of NL63. 

Identifying critical residues for RBD-hACE2 
binding 

A previous study on the RBD of SARS-CoV suggested that 
cysteine and some negatively charged residues appear to be 
important for receptor association (Wong et al, 2004). To 
identify the critical residues in NL63 RBD for hACE2 
binding, we first generated 20 RBD variants (Table 1) in 
which a single cysteine (C) or a negatively or positively 
charged residue was replaced by alanine (A). These mutants 
comprised six cysteine (C497A, C500A, C516A, C550A, 
C567A and C577A), two aspartic acid (D484A and D538A), 
three arginine (R518A, R525A and R530A), three glutamic 
acid (E572A, E582A and E602A) and six lysine (K501A, 
K532A, K546A, K556A, K562A and K564A) point muta¬ 
tions. The expression plasmids encoding these RBD variants 
were transiently transfected into 293T cells and protein 
expression levels in the cell culture supernatants were 
detected by WB. Six of the mutants (C516A, C550A, 
C567A, C577A, R525A and K556A) either did not express or 
had very low expression levels in the supernatants of 
transfected 293T cells, whereas the remaining 14 RBD 
mutants exhibited relatively high expression levels (Table 1 
and Fig. 4a, top panel). Subsequently, similar amounts of 
protein were used for IP to detect RBD-hACE2 associations. 
As illustrated in the bottom panel of Fig. 4(a), alanine 
substitution of seven residues (C497, C500, D538, R518, 
R530, E582 and K501), respectively, could either abolish or 
dramatically decrease the receptor binding efficiency. 

To identify additional residues that are critical for RBD- 
hACE2 binding, we constructed a random mutant library of 
NL63 RBD and screened protein expression of 143 mutants 
by WB. Surprisingly, most of the mutants (118/143, 83%) 
had no or very low levels of protein expression in the 
supernatants of transfected 293T cells, suggesting that 
protein folding and/or stability of the RBD is vulnerable to 
mutagenic changes (data not shown). However, consid¬ 
erable levels of protein expression were observed for the 
remaining 25 mutants (Fig. 4b, top panels). Similar amounts 
of protein were used for the detection of RBD-hACE2 
association by IP assay (Fig. 4b, bottom panels). Eleven RBD 
mutants (1-5, 1-22, 1-31, 1-36, 1-74, 2-10, 3-2, 3-7, 3-42, 3- 
46 and 3-55) were found to have undetectable or 
dramatically decreased binding activities to hACE2 receptor, 
whilst the other 14 mutants (1-10, 1-11, 1-19, 1-23, 1-30, 1- 
38, 1-66, 1-72, 1-77, 1-80, 2-1, 2-20, 3-15 and 3-40) could 
bind efficiently to hACE2. Sequence analysis revealed that 
three mutants carried a single amino acid substitution ( 1-22 
with V531E, 3-42 with V499I and 3-46 with G534A), 
whereas the other eight mutants harboured two or more 
amino acid substitutions in RBD (Table 2). To pinpoint 
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Fig. 3. Mapping of the minimal RBD. (a) Diagram of NL63 full-length SI and eight SI variants constructed for this study, (b) 
Detection of protein expression by WB and S1-hACE2 association by IP as described in Fig. 2. Three of the SI variants (S 411 _ 6 i6, 
S 361 - 6 I 6 and S 311 _ 6 i 6 ) were not detectable in the supernatants of transfected 293T cells (data not shown) and no further assays 
were conducted on these variants. Purified SI proteins (500 ng each) were detected by WB using anti-c-myc antibody (top panel), 
and the precipitated hACE2 receptors were detected by WB using anti-C9 antibody (bottom panel), (c) Detection of minimal RBD- 
hACE2 association by reciprocal IP assays. Purified SI proteins (1000 nM each), which were detected by WB using anti-c-myc 
antibody (top panel), were either incubated with soluble hACE2-lg fusion protein and protein A/G beads followed by WB detection 
of the precipitated SI proteins using anti-c-myc antibody (middle panel), or were incubated with C9-tagged hACE2 protein lysate, 
anti-His antibody and protein A/G beads followed by WB detection of the precipitated hACE2 receptors using anti-C9 antibodies 
(bottom panel), (d, e) Examination of the binding efficiency of RBD and SI to hACE2 by flow cytometry. Human 293T cells 
transiently transfected with an empty plasmid (data not shown) or the plasmid encoding hACE2 were incubated with similar 
amounts of Sl-lg protein (d), followed by incubation with FITC-conjugated anti-human IgG Fc antibody and FACS analysis. The 
data are presented as the mean fluorescence value (e). 
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Table 1. Protein expression and receptor binding of NL63 
RBD variants generated by site-directed mutagenesis with 
alanine substitution 


Protein expression levels and receptor-binding efficiency were 
measured by WB. The density of protein bands was quantified using 
Scion Image. The density of the wt RBD band was set at 100% and 
those of mutants were normalized against wt. nd, Not done. 


RBD Residue Protein Receptor 

variant change expression (%) binding (%) 


100 

40 

48 

5 


100 

0 

0 

ND 


Wt None 

C497A C497—>A 

C500A C500—>A 

C516A C516—>A 

C550A C550—>A 

C567A C567—>A 

C577A C577—>A 

D484A D484—>A 

D538A D538—>A 

R518A R518—>A 

R525A R525—>A 

R530A R530—>A 

E572A E572—>A 

E582A E582—>A 

E602A E602—>A 

K501A K501—>A 

K532A K532—>A 

K546A K546—>A 

K556A K556—>A 

K562A K562—>A 

K564A K564—>A 


3 

ND 

3 

ND 

1 

ND 

92 

87 

80 

0 

66 

4 

0 

ND 

78 

0 

62 

118 

85 

0 

65 

91 

72 

10 

71 

117 

100 

133 

0 

ND 

69 

135 

79 

143 


further the role of individual amino acid substitutions in 
these mutants, we generated a series of new variants with 
only one amino acid substitution in each construct. 
Consequently, we identified an additional five residues 
(Y498, G537, S540, W585 and T591) that appeared to be 
critical for RBD-hACE2 binding (bottom panels of Fig. 4c 
and Table 2). The protein expression levels of two mutants 
(S523P and V515E) were undetectable in the supernatants 
and thus were not included in the IP assay (data not shown). 
Furthermore, we noticed that substitutions of some residues 
with different amino acids, such as E582A and E582K; V499I 
and V499L; K501A, K501M and K501N; T591A and T591I; 
and Y498H and Y498F exhibited similar effects on receptor 
binding (Tables 1 and 2 and Fig. 4), reinforcing the 
importance of these residues. Taken together, we identified 
15 residues in the NL63 RBD (C497, Y498, V499, C500, 
K501, R518, R530, V531, G534, G537, D538, S540, E582, 
W585 and T591) that appear to be important for the RBD- 
hACE2 association. 


DISCUSSION 

In this study, we mapped residues 476-616 as the minimal 
RBD of the S protein of NL63. Interestingly, all known 


RBDs of group I coronaviruses, including 229E, TGEV and 
NL63, are located towards the C terminus of the SI domain 
(Bonavia et al, 2003; Godet et al, 1994; Laude et al, 1995). 
In contrast, one group II coronavirus, MHV, has the RBD 
located at the N terminus of its SI subunit, and the SARS- 
CoV RBD is located near the central region of the SI 
domain (Kubo et al, 1994; Wong et al, 2004) (Fig. 5). 
Therefore, the relative location of RBD in SI of different 
coronaviruses seemingly correlates with their evolutionary 
and serological relationships. Whether or not this repre¬ 
sents a conservative trait in the evolution of coronaviruses 
remains to be addressed. 

Another interesting finding revealed by this study was the 
dramatic differences in receptor-binding efficiency between 
SI and RBD of NL63 (Fig. 3). Whilst two previous studies 
of SARS-CoV have also suggested that RBD could bind to 
hACE2 more efficiently than full-length SI, they only 
found differences of less than twofold (Babcock et al, 2004; 
Wong et al, 2004). In contrast, we observed that hACE2 
binding efficiency by the NL63 RBD was similar to that of 
SARS-CoV SI or RBD, but was about six times higher than 
that of NL63 S1 (based on the mean fluorescent value in 
flow cytometry assays, Fig. 3e). This sharp difference in 
receptor-binding efficiency might have important implica¬ 
tions for the pathogenesis, tropism and transmission of 
NL63 and SARS-CoV. It has been shown that hACE2 
expression in cell lines correlates with susceptibility to 
SARS-CoV S-driven infection (Esposito et al, 2006; 
Hofmann et al, 2004). We have previously demonstrated 
that the receptor-binding affinity of the SI protein of 
GD03T0013, a SARS-CoV strain associated with mild 
disease, was significantly lower than the SI of Tor2, a 
highly pathogenic strain isolated from a patient who died 
of SARS (Li et al, 2005b). Additional studies of the SARS- 
CoV-infected mouse model also strongly suggest that 
disease severity is correlated with S protein-mediated 
downregulation of ACE2 in vivo (Imai et al, 2005; Kuba 
et al, 2005). Therefore, the weak association between 
hACE2 and NL63 SI might contribute to mild clinical 
symptoms in NL63-infected patients. Furthermore, the 
strong association of NL63 RBD with hACE2 suggests that 
the folding structure of RBD might be different from that 
of SI. It may also indicate that some efficient binding sites 
may be buried inside the structure of SI. This implies that 
NL63 possesses the potential to use hACE2 as efficiently as 
SARS-CoV. Mutations accumulate over time, some of 
which might alter the protein folding and expose the 
efficient binding sites on the surface. This may lead to the 
emergence of new NL63 strains that would be highly 
pathogenic to humans. 

We have also conducted a series of mutagenesis studies on 
the NL63 RBD and identified 15 residues that appear to be 
critical for RBD-hACE2 interaction. We found that all of 
the six cysteine residues within the NL63 RBD are 
important for protein expression and/or hACE2 asso¬ 
ciation (Table 1 and Fig. 4a), which is similar to the roles of 
cysteine residues in the RBD of SARS-CoV (Wong et al, 
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Fig. 4. Identifying critical residues for ACE2 binding. The top panels in Fig. 4(a), (b) and (c) show the wild-type (Wt) and 
mutant RBD protein expression levels detected by WB using anti-c-myc antibody, whilst the bottom panels show the 
precipitated hACE2 receptors detected by WB using anti-C9 antibody. The asterisks (*) and the arrow (f) indicate the RBD 
variants that had no or dramatically decreased binding activity, respectively, to hACE2 receptor. In addition, the asterisk (*) 
indicates the RBD variants bearing only one amino acid substitution, whereas the arrow (f) indicates the mutant harbouring 
two or more amino acid substitutions, (a) Detection of protein expression and receptor binding of 14 NL63 RBD variants 
generated by site-directed mutagenesis, (b) Detection of protein expression and receptor binding of 25 NL63 RBD variants 
generated by random mutant library-screening assays, (c) Further characterization of individual residues in RBD variants with 
multiple mutations found in (b). 


2004). This previous study by Wong et al. (2004) identified 
two acidic residues (E452 and D454) that appeared to be 
important for the interaction between hACE2 and the RBD 
of SARS-CoV. Interestingly, two acidic residues (D538 and 
E582) in the NL63 RBD that appeared to be critical for 
receptor binding were also identified in our studies (Table 
1 and Fig. 4a). In addition, five basic residues in the NL63 
RBD (R518, R525, R530, K501 and K556) were identified 
to be critical for RBD protein expression and/or hACE2 
binding (Table 1 and Fig. 4a). Furthermore, we identified 
seven critical residues from the screening assay of the 
random mutant library of NL63 RBD: two valines (V499 
and V531), two glycines (G534 and G537), one tyrosine 
(Y498), one serine (S540) and one trytophan (W585). Our 
studies also suggest that, in addition to site-directed 


mutagenesis, screening of a PCR-generated random mutant 
library may become a powerful strategy for the identification 
of critical residues that are important for RBD-receptor 
binding or other protein-protein interactions. 

When we tried to understand how these critical residues may 
interact with the hACE2 receptor, it was interesting to find 
that, with the exception of R518, the other 14 critical residues 
clustered into three separated regions (designated RI, RII and 
RIII) in the RBD. As shown in Fig. 5, the RI region contains 
residues C497, Y498, V499, C500 and K501, and RII contains 
residues R530, V531, G534, G537, D538 and S540, whilst RIII 
contains residues E582, W585 and T591. These regions may 
represent three receptor-binding sites inside the NL63 RBD. 
Previous studies of the SARS-CoV RBD-hACE2 interaction 
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Table 2. Protein expression and receptor binding of NL63 
RBD variants generated by random library screening and site- 
directed mutagenesis assays 


Protein expression levels and receptor-binding efficiency were 
measured by WB. The density of protein bands was quantified using 
Scion Image. The density of the wt RBD band was set at 100% and 
those of mutants were normalized against wt. 


RBD Residue Protein Receptor 

variant change expression (%) binding (%) 


Wt None 

1-5 Y498—>H, F560—>Y 

Y498H Y498—>H 

F560Y F560—*Y 

1-22 V531—>E 

1-31 K501—>M, W585—>R 

K501M K501—>M 

W585R W585—>R 

1-36 K501—>N, G537—>R, 

T591—>1 
K501N K501—>N 
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Y498F Y498—>F 

3-7 W585—>R, S540—>N 

S540N S540—>N 

3-15 L476—>Q 

3-42 V499—>1 

3-46 G534—>A 

3-55 K501—>M, N604—>D 

N604D N604—>D 
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Fig. 5. Schematic diagram of the locations of the RBD of different 
coronaviruses and the proposed receptor-binding sites in the 
NL63 RBD. The locations of the RBDs of the S proteins of SARS- 
CoV, MHV, 229E, TGEV and NL63 are shown here. In addition, 14 
of the 15 critical residues identified in this study clustered into 
three separate regions in the NL63 RBD, designated regions I (Rl), 
II (Rll) and III (Rill), which may represent three receptor-binding 
sites. 


but may also advance our understanding of the pathogenesis 
of NL63 and SARS-CoV. 
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and the crystal structure of the SARS-CoV RBD-hACE2 
complex have identified a panel of residues in the RBD 
region that are critical for the RBD-hACE2 association, and 
demonstrated that some of these residues may play 
important roles in the cross-species transmission and 
pathogenesis of SARS-CoV (Li et al. , 2005a, b). Therefore, 
the current study and further characterization of NL63 RBD- 
hACE2 interactions, including examination of the crystal 
structure of the NL63 RBD-hACE2 complex may not only 
help our understanding of the molecular interactions 
between the S protein of NL63 and the hACE2 receptor, 
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